Spatio-textual Indexing for Geographical Search on the Web
نویسندگان
چکیده
Many web documents refer to specific geographic localities and many people include geographic context in queries to web search engines. Standard web search engines treat the geographical terms in the same way as other terms. This can result in failure to find relevant documents that refer to the place of interest using alternative related names, such as those of included or nearby places. This can be overcome by associating text indexing with spatial indexing methods that exploit geo-tagging procedures to categorise documents with respect to geographic space. We describe three methods for spatio-textual indexing based on multiple spatially indexed text indexes, attaching spatial indexes to the document occurrences of a text index, and merging text index access results with results of access to a spatial index of documents. These schemes are compared experimentally with a conventional text index search engine, using a collection of geo-tagged web documents, and are shown to be able to compete in speed and storage performance with pure text indexing.
منابع مشابه
The SPIRIT Spatial Search Engine: Architecture, Ontologies and Spatial Indexing
The SPIRIT search engine provides a test bed for the development of web search technology that is specialised for access to geographical information. Major components include the user interface, geographical ontology, maintenance and retrieval functions for a test collection of web documents, textual and spatial indexes, relevance ranking and metadata extraction. Here we summarise the functiona...
متن کاملIndexation sémantique et recherche d'information interactive
Among the various facets of Information Retrieval in textual data, the search for information located in space and time constitutes a full research field. Indeed, it requires, for indexing as for retrieval, specific linguistic analyses and resources. The present paper roots in the GéoSem project, whose aim is to develop advanced, semantic-based methods for geographical documents retrieval. Toda...
متن کاملFAST: Frequency-Aware Spatio-Textual Indexing for In-Memory Continuous Filter Query Processing
The ubiquity of spatio-textual data comes from the popularity of GPS-enabled smart devices, e.g., smartphones. These devices provide a platform that supports a wide range of applications that generate and process spatio-textual data. These applications include social networks, micro-blogs, web-search for local attractions and events, and location-aware ad targeting. These applications need to p...
متن کاملDemo Paper: A Spatio-Temporal-Textual Crime Search Engine
This paper proposes a STT(spatio-temporal-textual) search engine for extracting, indexing, querying and visualizing crime information. Until recently, it’s a labor-intensive work to identify crime entities, cluster similar suspect activities, and discover patterns from massive online collections. It’s a big challenge to reveal inherent ST(spatio-temporal) correlations among mass crime informati...
متن کاملSKIF-P: a point-based indexing and ranking of web documents for spatial-keyword search
There is a significant commercial and research interest in location-based web search engines. Given a number of search keywords and one or more locations (geographical points) that a user is interested in, a location-based web search retrieves and ranks the most textually and spatially relevant web pages. In this type of search, both the spatial and textual information should be indexed. Curren...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005